cient Disk Allocation for Fast Similarity Searching

نویسندگان

  • Sunil Prabhakar
  • Divyakant Agrawal
  • Amr El Abbadi
چکیده

As databases increasingly integrate non-textual information it is becoming necessary to support eecient similarity searching in addition to range searching. Recently, declustering techniques have been proposed for improving the performance of similarity searches through parallel I/O. In this paper, we propose a new scheme which provides good declus-tering for similarity searching. In particular, it does global declustering as opposed to local declustering, exploits the availability of extra disks and does not limit the partitioning of the data space. Our technique is based upon the Cyclic declustering schemes which were developed for range and partial match queries. We establish, in general, that Cyclic declustering techniques outperform previously proposed techniques.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Circular Data-space Partitioning for Similarity Queries and Parallel Disk Allocation

In a multiple disk environment it is desirable to have techniques for efficient parallel execution of similarity queries. Usually many buckets that may have the query result are needed to be retrieved from secondary storage, which is a costly operation. To achieve efficiency, there are two major factors that need to be considered. These are the number of buckets retrieved by the query, and the ...

متن کامل

Concentric Hyperspaces and Disk Allocation for Fast Parallel Range Searching

Data partitioning and declustering have been extensively used in the past to parallelize I/O for range queries. Numerous declustering and disk allocation techniques have been proposed in the literature. However, most of these techniques were primarily designed for two-dimensional data and for balanced partitioning of the data space. As databases increasingly integrate multimedia information in ...

متن کامل

The Trade-o¤Between Fast Learning and Dynamic E¢ ciency

In both static and dynamic, independent private values setups, the corresponding e¢ cient allocation is implementable if the distribution of agents’values is known. Lack of knowledge about the distribution is inconsequential in the static case. But, if distribution of agents’ values is not known in a dynamic framework, and if the designer gradually learns about it by observing present values, e...

متن کامل

Perfect Allocation Methods for Spatial Queries in Parallel Disk Systems

A disk-allocation method assigns a disk-id to each unit of spatial data. Allocating spatial data over multiple disks to distribute the I/O cost of query processing uniformly over available disks can tremendously speed up the processing. An allocation method is called perfect for a query set if it balances the I/O load on each disk in processing any query in a query set. Some of the interesting ...

متن کامل

Evaluation of Disk Allocation Methods for Parallelizing Spatial Queries on Grid Files‡

Spatial Database Systems are characterized by large amounts of geometric and geographic data. Query response times in these systems are crucial, since these systems are often used interactively for decision support systems. The Grid file[1] is a well-known spatial access method that has great potential for parallelism, which reduces the response time of spatial queries for time-critical on-line...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1997